GENMARK: Parallel Gene Recognition for Both DNA Strands
نویسندگان
چکیده
The problem of predicting gene locations in newly sequenced DNA is well known but still far from being successfully resolved. A novel approach to the problem based on the frame dependent (non-homogeneous) Markov chain models of protein-coding regions was previously suggested. This approach is, apparently, one of the most powerful "search by content" methods. The initial idea of the method combines the specific Markov models of coding and non-coding region together with Bayes' decision making function and allows easy generalization for employing of higher order Markov chain models. Another generalization which is described in this article allows the analysis of both DNA strands simultaneously. Currently known gene searching methods perform the analysis of the two DNA strands in turn, one after another. In doing thisall the known methods fail in teh sense that they generate false (artifactual) predition signals for the given strand when the real coding region is located on the complementary DNA strand. This common drawback is avoided by employing the Bayesian algorithm which uses an additional non-homogeneous Markov chain model of the "shadow" of the coding region-the sequence which is complementary to the protein-coding sequence.
منابع مشابه
Parallelizing Assignment Problem with DNA Strands
Background:Many problems of combinatorial optimization, which are solvable only in exponential time, are known to be Non-Deterministic Polynomial hard (NP-hard). With the advent of parallel machines, new opportunities have been emerged to develop the effective solutions for NP-hard problems. However, solving these problems in polynomial time needs massive parallel machines and ...
متن کاملTargeting individual subunits of the FokI restriction endonuclease to specific DNA strands
Many restriction endonucleases are dimers that act symmetrically at palindromic DNA sequences, with each active site cutting one strand. In contrast, FokI acts asymmetrically at a non-palindromic sequence, cutting 'top' and 'bottom' strands 9 and 13 nucleotides downstream of the site. FokI is a monomeric protein with one active site and a single monomer covers the entire recognition sequence. T...
متن کاملProduction of Cyclin D1 specific siRNAs by double strand processing for gene therapy of esophageal squamous cell carcinoma
Background: RNAi (RNA interference) is a new strategy in gene therapy and biotechnology which provides new promises in the treatment of different diseases such as cancer and viral diseases. CCND1 which is a key gene in cell cycle is amplified and over expressed in esophageal cancer. The objective of this study was production and siRNAs for CCND1, the key gene in cell cycle. Materials and Metho...
متن کاملAssembling of G-strands into novel tetra-molecular parallel G4-DNA nanostructures using avidin–biotin recognition
We describe a method for the preparation of novel long (hundreds of nanometers), uniform, inter-molecular G4-DNA molecules composed of four parallel G-strands. The only long continuous G4-DNA reported so far are intra-molecular structures made of a single G-strand. To enable a tetra-molecular assembly of the G-strands we developed a novel approach based on avidin-biotin biological recognition. ...
متن کاملCRITICA: coding region identification tool invoking comparative analysis.
Gene recognition is essential to understanding existing and future DNA sequence data. CRITICA (Coding Region Identification Tool Invoking Comparative Analysis) is a suite of programs for identifying likely protein-coding sequences in DNA by combining comparative analysis of DNA sequences with more common noncomparative methods. In the comparative component of the analysis, regions of DNA are al...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computers & Chemistry
دوره 17 شماره
صفحات -
تاریخ انتشار 1993